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A commentary on 

Differences in visual motion representa- 
tion between cortical areas MT and MST 
during ocular following responses 

by Kenichiro, M., Naoko, I., Yuki, A., 
and Kenji, K. (2014). J. Neurosci. 34, 
2160-2168. doi: 10.1523/JNEUROSCI.3797- 
13.2014 

Estimating object motion within dynamic 
visual scenes is a critical skill for ani- 
mals. In primates, this is supported by 
cortical neurons identified in Primary 
Visual Cortex (VI) and so called motion 
areas Middle Temporal (MT) and adja- 
cent Medial Superior Temporal area (MST, 
which receives MT inputs) of macaques 
(Van Essen and Maunsell, 1983). Previous 
work has uncovered neural mechanisms 
serving motion direction estimation but 
there remain gaps in understanding the 
computation and representation of speed 
in the brain and the roles MT and MST 

Play- 
Recently published work (Miura et al., 
2014) characterized the response proper- 
ties of MT and MST neurons of the rhe- 
sus monkey during a brief presentation 
of moving grating stimuli. The large field 
stimulation evoked fast (~55 ms in mon- 
key) reflexive ocular following responses 
(OFR; Miles et al., 1986) during which 
spiking rates of neurons in the two areas 
were measured. OFR is thought to rely 
on fast motion estimation computations 
which occur in part in MT and MST to sta- 
bilize gaze by driving the eyes at a speed 
and direction dependent on the stimu- 
lation (Kawano et al., 1994; for review 
Masson and Perrinet, 2012). Thus, OFR 



accurately reflects early activity through 
very few synaptic connections in the corti- 
cal motion pathway, making it an excellent 
behavioral probe of sensory processing. 

Miura et al. (2014) recorded neural 
responses within a 60 ms window start- 
ing 20 ms before eye movements. Using 
a luminance grating stimuli they directly 
probed how MT sensitivity related to 
MST sensitivity before comparing both 
to behavioral responses. By recording 
spatiotemporal frequency responses in 
different cells during OFR, they eluci- 
dated speed estimation and representa- 
tion mechanisms. Their paradigm exploits 
Fourier Theory, which states that any sig- 
nal comprises a set of sinusoidal gratings 
added together each with different ampli- 
tudes. Thus, testing a range of discreet 
points of stimulus temporal and spatial 
frequency parameters defines experimen- 
tally targeted scales linking behavioral task 
performance to neural computation per- 
formed by units with specific spatial and 
temporal scales of sensitivity or receptive 
fields. This stimulus constrains the eco- 
logical interpretation as sine waves are 
not typically seen in natural scenes so 
the assumption of linearity critical in 
the Fourier approach must be considered 
when generalizing findings to more com- 
plex scenes. The authors fitted the dis- 
creetly sampled points of spatio-temporal 
responses of each MT and MST unit with 
a two dimensional function composed of 
spatial and temporal Gaussian terms. This 
allowed them to estimate the frequency 
response profile of each unit and use a 
regression of these to estimate the popu- 
lation response in each area. Interestingly 
MST is seen to have a broad spatial 



frequency sensitivity extending to lower 
frequencies where there is little or no sensi- 
tivity in MT. They suggest that MT extracts 
motion at finer scales while MST is respon- 
sible for coarser scales. However, both 
showed similar optimal temporal frequen- 
cies around 20 Hz. As both structures have 
parallel outputs to the Pontine Nucleus 
which drives OFR, this leads to the ques- 
tion: does the critical locus of this driving 
role depend on the scales of stimulation? 

In the fitting estimating unit response 
profiles used by Miura et al. (2014), the 
temporal Gaussian contained a parame- 
ter Q that quantified the diagonal ori- 
entation of the response in frequency 
space. Diagonally oriented or insepara- 
ble responses (Q = 1) indicate speed sen- 
sitivity independent of spatial features. 
Conversely, profiles elongated across spa- 
tial frequencies showing a constant opti- 
mum temporal frequency are separable 
(Q~0), characteristic of temporal fre- 
quency tuning. Units showed a continuum 
of Q values; MT had a median of 0.25 
making it more speed tuned than MST 
at 0.07. The low values surprisingly imply 
that most units in these motion areas have 
speed sensitivity that depends on spatial 
features. A comparison of this MST result 
to VI population Q values of 0.08 (Priebe 
et al., 2006) shows similar speed tuning in 
MST and generic visual area VI. Previous 
work using a different method to estimate 
separability found a higher proportion 
of MT units (~0.25) to be speed tuned 
(Perrone and Thiele, 2001). Separability 
estimates depend on how the sparsely sam- 
pled frequency space is fitted with a 2D 
function and the use of symmetric func- 
tions by Miura et al. influenced estimates 



Frontiers in Computational Neuroscience 



www.frontiersin.org 



August 2014 | Volume 8 [ Article 92 | 1 



Meso and Simoncini 



The cortical locus of visual speed estimation 



(Perrone, 2006). The frequency bandwidth 
of units makes them sensitive not just to 
sine waves but to features like edges ubiq- 
uitous in natural scenes. Neural record- 
ings over longer presentations (> 700 ms 
compared to 60 ms) found the tuning of 
initially less separable units to be more 
separable under broadband stimulation 
(Perrone and Thiele, 2001; Priebe et al, 
2003). Human psychophysics experiments 
with presentations times comparable to 
Miura et al. (2014) found that increas- 
ing frequency bandwidth strengthens OFR 
reducing variability, but degrades speed 
comparison judgments (Simoncini et al, 
2012). While presentation time effects and 
task specificity remain unclear, speed esti- 
mation under natural stimulation seems 
to exploit the disambiguating effects of 
broadband frequencies (Priebe et al., 2006; 
Meso and Zanker, 2009). 

Miura et al. (2014) filled a void in the 
literature by characterizing MST frequency 
responses and comparing that with MT 
to reveal differences in preferred spatial 
scales. They mapped out the OFR spatio- 
temporal frequency responses finding sen- 
sitivity similar to MST, but shifted toward 
higher frequencies. They showed that the 
OFR profile could be reconstructed by a 
linear summation of MT and MST pro- 
files, suggesting both contribute through 
parallel and not serial computations to 
drive OFR. This raises the question: What 
might the roles of MT/MST be when OFR 
is replaced by or accompanied by percep- 
tual judgments? 

There remain open questions about 
characteristic separability and specific roles 
of both areas when processing natu- 
ral scenes. Theoretical work continues to 
resolve how unit outputs from an MT 
under broadband stimulation might be 



transformed to resemble MST units and 
better predict behavior (Perrone, 2012). 
Human speed perception and action 
(OFR) seem to involve different infor- 
mation integration strategies (Simoncini 
et al., 2012). There is much work left 
to standardize methods for characteriz- 
ing sparsely sampled frequency responses 
and compute separability, understand pro- 
cessing under naturalistic stimulation and 
probe the parallel pathways and their 
implications for perception and action 
computations. 
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